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ABSTRACT 

Context. In cosmic shear likelihood analyses the covariance is most commonly assumed to be constant in parameter space. Therefore, when 
calculating the covariance matrix (analytically or from simulations), its underlying cosmology should not influence the likelihood contours. 
Aims. We examine whether the aforementioned assumptions hold and quantify how strong cosmic shear covariances vary within a reasonable 
parameter range. Furthermore, we examine the impact on likelihood contours when assuming different cosmologies in the covariance. The 
final goal is to develop an improved likelihood analysis for parameter estimation with cosmic shear. 

Methods. We calculate Gaussian covariances analytically for 2500 different cosmologies. In order to quantify the impact on the parameter 
constraints we perform a likelihood analysis for each covariance matrix and compare the likelihood contours. To improve on the assumption of 
a constant covariance, we use an adaptive covariance matrix, which is continuously updated according to the point in parameter space where 
the likelihood is evaluated. As a side-efi"ect, this cosmology dependent covariance improves the parameter constraints. We examine this fact 
more closely using the Fisher-matrix formalism. In addition we quantify the impact of non-Gaussian covariances on the likelihood contours 
using a ray-tracing covariance derived from the Millennium simulation. In this ansatz we return to the approximation of a constant covariance 
matrix; in order to minimize the error due to this approximation, we develop the concept of an iterative likelihood analysis. 
Results. Covariances vary significantly within the considered parameter range. The cosmology assumed in the covariance has a non-negligible 
impact on the size of the likelihood contours. This impact increases with increasing survey size, increasing number density of source galaxies, 
decreasing ellipticity noise, and when taking non-Gaussianity into account. A proper treatment of this effect is therefore even more important 
for future surveys. In this paper we present methods to take cosmology dependent covariances into account. 
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1. Introduction 



Cosm ic shear, which w as firs t d etected in 2000 jBacon et al 



2000t iKaiser et alj I2OOOI: Ivan Waerbeke etaP l2000t 



tool in cosmology. Late st result s ( e.g. ivan Waerbeke et al.. 
20051; [Sernbolonietal.1 I2OO6I: iHoekstra et alj l2006t 



in iJoachimi et al.l (120081) who use a power spectrum approach 
which significantly reduces the computational effort in the 
calculation. This analytic expression has been use d for pa- 
rameter estimation in inany surveys (e.g. [v an Wae r beke et al 



— ' II I rameter estimation in many survevs (e.g. ^n Wae r beKe et al. 

| Wittmanetal. || 2000D , has recently progress_ed to an important ^ Isemboloni et allEoOel; iHoekstra et aPM . However, 



Schrabback et al.l2007l;lHetterscheidt et al.lbOOTtlMassev et al 
2007; Fu et al. 20081) akeadv indicate its great ability to con- 
strain cosmological parameters which will be enhanced in the 
future by large upcoming surveys like Pan-STARRS, KIDS, 
DES, Euclid or LSST. The improved quality of cosmic shear 
data must be accompanied with an accurate data analysis, free 
of assumptions which bias the results. Obtaining appropriate 
covariances is a crucial issue in this context of a precision 
cosmology likelihood analysis. Several methods are suggested 
in the literature and have been applied to cosmic shear data. 
An analytic expressio n for covariances assum ing a Gaussian 
shear field is derived in lSchneider et alj (l2002al) and confirmed 



the assumption of a Gau ssian shear field breaks dow n on 
small scales; according to iKilbinger & Schneiden (l2005b and 



Semboloni et al.l (120071) non-linear effects already become 
important at angul ar scales < 10 ar cmin. To account for non- 
Gaussianity, jSembolomeLalJ (I2OO7I) invent a calibration factor 
which is derived from a comparison of Gaussian to ray-tracing 
covarianc es. An applic ation of this method to real data can be 
found in dFu et al.ll200S) . A second approa ch is the derivation 



of the covariance matrix fr om the data (e.g. lHetterscheidt et al 
2OO7I ; iMassev et al.l 12007 ). Here, the covariance is calculated 
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via field-to-field variation which involves a separation of the 
data set into many independent subsamples. This might lead to 
a loss of information on large scales if the survey is not suf- 
ficiently large. Third, one can estimate the covariance matrix 
from ray-tracing simulations, a method which circumvents the 
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aforementioned loss in information. Although, in this method 
the covariance is again derived via field-to-field variation, we 
can choose a sufficiently large numerical simulation to create 
many independent subsamples of adequate size. 
Note, that the last two methods involve an estimation process 
in the determination of the covariance matrix, which means 
that the inverse is biased and one has to correct for this effect 



Jo 27r 



(3) 



(lAndersonl l2003t iHartlap et al.ll2007h . Nevertheless, deriving 
covariance matrices from ray-tracing simulations seems to be 
a promising method as it preserves all the information in the 
data and additionally takes the non-Gaussianity of the shear 
field into account. 

The analytic expression and the ray-tracing covariance assume 
a specific cosmological model in their derivation. So far, 
cosmic shear likelihood analyses treat the covariance matrix as 
constant in parameter space, hence its underlying cosmology 
is assumed not to influence the parameter constraints. It is the 
intention of this paper to check for this assumption and in case 
it does not hold, to present an improved likelihood formalism 
for future surveys. 

This paper is organized as follows. Section |2] summarizes the 
basic theoretical background of the cosmic shear two-point 
correlation function (2PCF) and its corresponding covariance. 
In Sect. [3] we derive a scaling relation for covariances, which 
can be used for a fast calculation of covariances for arbitrary 
cosmology. Furthermore, we examine how strongly the 
covariance depends on its underlying cosmological model. 
The impact on parameter constraints when assuming a fixed 
cosmology in the covariance is subject of Sect. H] whereas we 
present improvements on this assumption in Sect. |5] Here, 
we consider a likelihood analysis with an adaptive covariance 
matrix, i.e. the covariance is calculated individually for each 
point in parameter space where the likelihood is evaluated. 
In addition, we outline the concept of an iterative covariance 
matrix, i.e. several likelihood analyses are performed, where 
the covariance is updated in every iteration according to the 
maximum likelihood parameter set. Here, we also examine 
the impact of non-Gaussian covariances on the likelihood 
contours using a ray-tracing covariance matrix derived from 
the Millennium Simulation. We conclude in Sect. |6] 

2. Data vectors and covariances of cosmic shear 

In this section we briefly review the basics of the cosmic shear 
two-point correlation function and its corresponding covari- 
a nce matrix. For more details on th i s topic the reader is refer red 
to'Bartelmann & Schneider' ('200l');'Schneider et all (l2002a!|bl); 
kilbinger & Schneid er (2004); Joa chimi et al. (200^). 
To measure the shear signal we define as the connecting vec- 
tor of two points and specify tangential and cross-component 
of the shear y as 

rt = -Re(ye"-'^) and yx = -Im(re"2'^) , (1) 

where (p is the polar angle of 0. The 2PCFs depend only on the 
absolute value of 6. They are defined in terms of the shear and 
can be related to the power spectra Pe and f b dSchneider et al 
2002bh 



with J„ denoting the n-th order Bessel-function. In this paper 
we only consider E-modes, therefore we set Pb = from now 
on. Furthermore, we assume that the 2PCF is estimated in log- 
arithmic bins 1? of angular width A§. The covariance of the 
2PCF is defined as 

(i?„ := {{^^m - U§d) {^±(^j) - U^j))) ■ (4) 

We neglect the index ^ in the covariance for the rest of the 
paper as we only consider covariances of the 2PCF. As one al- 
ready sees from (|4]i the 2PCF has four different covariances, 
denoted as C++, C+-, C-+, C— . Only three of them are inde- 
pendent since C+_()?/, i?j) = C-+{§j,§i). Assuming a Gaussian 
shear field the covariance of the 2PCF can be calcul ated analyt- 
ically dSchneider et al.ll2002al:|joachimi et a02008l) . There, the 
covariance is decomposed into three terms, namely the cosmic 
variance term (V), the pure shot noise term (S), and the mixed 
term (M) 



C++(7?;,)?,) = y++ + M++ + 5, 



(5) 
(6) 
(7) 



The pure shot noise term vanishes in case of C+_ and only con- 
tributes to the diagonal of C++ and C— . It can be calculated 

as 



(8) 



In'&iMiAffl 



where A denotes the solid angle of the data field, cr^ is the in- 
trinsic ellipticity dispersion, and h the number density of source 
galaxies. The cosmic variance term (V) and the mixed term 
(M) can be either c alculated using the po wer spectrum or the 
2PCF. According to Joachimi et al.l ( 2008 ) the power spectrum 
approach leads to the following expressions 



1 r°° 

V±± = -r M{himi)hlA{t{^j)Pl{i), 

Jo 



(9) 

(10) 



The correspo nding expression s for V and M using the 2PCF 
are derived in ISchneider et al.l (l2002ah . In this paper we only 
need the expressions for the mixed term, which read 



2o-j r 

TiAn Jo 

2o-i r 

d^^+(|#cos(4^), 

nAn Jo 

x(W)"''f-(l*l)cos(4v), 
(2) where we denote |^| = 



M- 



M+ 



(11) 



(12) 



(13) 
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3. Variation of covariances in parameter space 

We select a two-dimensional parameter grid with 50 x 50 
gridpoints of e [0.2; 0.4] and erg e [0.6; 1.0]. For each 
grid point we calculate a covariance analytically using dU - 
([Tol l. The shear power spectra Pe are obtained from the den- 
sity power spectra employing Limber's equation. To derive 
Pg we assume an initial Harrison-Zeldovich power spectrum 
(Ps(k) PC k"' where «s = 1) with the transfer function from 



Efstathiou et al.l (Il992h . For the calculati on of the non-linea r 
evolution we use the fitting formula of ISmith et al.l (l2003b . 



Throughout this paper, we assume a flat universe and fix all 
cosmological parameters except Qm and cr^, more precisely 
Ho = 0.73 and Ob = 0.04. These values for Hq and fib together 
with fi,n = 0.25 and erg - 0.9 define our fiducial cosmological 
model, which we have c hosen similar to the c osmology of the 
Millennium Simulation (iSpringel et al.ll2005h for a later com- 
parison of Gaussian and ray-tracing covariances. We assume 
all source galaxies to be at redshift zo = 1 0. Using a redshift 
distribution instead would not change our results qualitatively. 
In addition to cosmology, the covariance depends on survey 
parameters. The scaling relations given in Sect. |3] are gener- 
ally valid and independent of survey parameters. In case of the 
likelihood analyses in Sects. |4]and|5]we choose, unless stated 
otherwise, an intrinsic ellipticity noise of cr^ = 0.4, a number 
density of source galaxies of n = 10/arcmin^ (similar to the 
values of the Dark Energy survey), and a survey which covers 
A = 900 deg^. The angular scale of the 2PCF data vector for 
which we calculate the covariances covers a range from 0. 1 ar- 
cmin to 180 arcmin, which is divided into 50 logarithmic bins. 
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Fig. 1. The dimensionless shear power spectrum { Pe- The 
solid curves correspond to variation in fim and erg: Qm - 0.2, 
erg = 0.6 (lower), Q„, = 0.3, erg = 0.8 (middle), = 0.4, 
erg = 1.0 (top). The dashed curves show variation in erg with 
Q,n = 0.25: erg = 0.6 (lower), erg = 0.8 (middle), era = 1.0 
(top). The dotted curves show variation in Q.^ with a constant 
o-g = 0.9: Q,n = 0.2 (lower), Q.^ = 0.3 (middle), Q.^ = 0.4 
(top). 



3.1. A fast method to calculate covariances for 
arbitrary ^nd o-g 

From ^ and (fTOb one directly sees that the covariance ma- 
trix depends on the cosmological model, which enters with the 
power spectrum Pe. Figure[T]illustrates the change in Pe when 
varying only or erg, and both parameters simultaneously; 
we see that it increases with as well as with erg . 
For a given cosmological model we can calculate the covari- 
ance directly from ^ - ( fTOl i. Performing this calculation for 
many sets of parameters is time-consuming; hence we seek a 
scaling relation, which relates the covariances of an arbitrary 
cosmology C;,- to a reference model C;r„. A basic theorem in 
statistics states (e.g. [Anderson 2003), that if there is a relation 
between two data vectors x and y which reads y - Ax (A be- 
ing a matrix), the relation of the covariances of x and y can be 
written as 

C, = (Cy - (y})(y - Cv))') 

= {(Ax - <Ajc))(Aa: - {Ax)y) 

= AC,A'. (14) 

In this derivation A must be independent of the ensemble av- 
erage. If we apply the above ansatz to the 2PCF, it seems rea- 
sonable to define a scaling relation for parameter dependent 
covariances as 



where we can calculate the scaling matrices A using the 2PCF 



diag(A) = ^^/^^, 



(16) 



In contrast to a covariance matrix, the 2PCF can be calcu- 
lated extremely fast for many different cosmologies via 
Hence, it would be a fast and convenient method to calcu- 
late the covariance for a reference cosmology and then apply 
(fTSl l to obtain covariances for arbitrary cosmological parame- 
ters. Unfortunately, we cannot transfer this method directly to 
the cosmic shear case. Recall, that the 2PCF is derived from 



Q.=AC. A', 



(15) 



the measured ellipticities of galaxies. [Schneider et al.l (l2002bh 
have shown that the intrinsic ellipticity terms cancel out in the 
derivation of the 2PCF estimator, hence the 2PCF is defined 
only in terms of the shear. In contrast, the 2PCF covariance 
does not only consist of terms coming from the shear, but has 
additional noise terms which arise from the intrinsic elliptic- 
ity of galaxies. The pure shot noise term ^ is independent of 
cosmology and, as can be seen from (II lH13b . the mixed term 
cannot be scaled with relation (flSl l. which is quadratic in the 
2PCF 

However, in the limit of a noise-free covariance, i.e. consid- 
ering only the cosmic variance term, a scaling relation similar 
to ( fTSl l exists. We explicitly prove this below, in particular, we 
show that the scaling matrices are independent of the ensemble 
average. The cosmic variance term can be calculated via (|9]l. 
Cosmology only enters with the power spectrum, hence the re- 
lation of TT to TTo can be described as Pe({, tt) = a({, n)PE((, JTo)- 
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Using this relation we transform the cosmic variance term (|9]l 
for given bins d-j as follows 



Jo 



where we discretize th e integral into a sum o f ^'-bins. Next we 
insert equation (26) of IJoachimi et al.l (120081) (see also iKaiser 
Il998l) but with cr, = 



{APEii)APEin) 

to rewrite (fTTI i as 



471 

A{A£ 



Pli{)5i 



(18) 



y±±i7:) = ■^_Y^AmhiAm)hiA{('&j)a{l7i)a(£',7:) 

IP 



:(APE(A?ro)APE(r,7ro)) 



(19) 



The mean value theorem guarantees that there exist values 
a(d-i, 7i), a(§j, 71) such that ( fT9] l becomes 



a{d^i,n)a(§j,7i:) 



Jo 



daJo/4(mi)APE(i,7ro) 



X dff]o/4({'^j)APE(f,7:o) 

Jo 

= a(i?;,;r)fl(??;,;r) V±±(;ro), 



(20) 



where we consider the limit Ai — > in the first step. 
Comparing the expressions of V±±(7r) and V'±±(;ro) we can cal- 
culate the scaling factors as 



a±±('&i,7r) 



(21) 



X diao/4m)PE(£,7ro) ^W.^ro) 

where we inserted ^ in the last step. This provides a fast and 
convenient method to scale the cosmic variance term in param- 
eter space, due to the fact we can use a computationally effi- 
cient Hankel transformation for the calculation of the 2PCF. 
From ( fTTT ) - ( fT3l l we see that the mixed term M±± scales lin- 
early with the 2PCF which prevents a scaling relation similar 
to ( fTSl l. Fortunately, the direct calculation of the linear term via 
( [Tol l is comparatively fast, therefore, the scaling relation for the 
cosmic variance term already reduces the computational costs 
significantly. 

Nonetheless, we numerically derive a fit-formula for the linear 
term based on the following expression 



025 



\0.9/ 



(22) 



The structure of this fit-formula is motivated by the intention 
to use as few fit-parameters as possible; additionally we re- 
quire that in the limit of the fiducial model, M±±(i?,, i?j,;r) = 
M±±{§i, §j, Ko) must hold. The fit-parameters a and yS vary de- 
pending on the scale j?,, &j and are different for the different 

parts of the covariance matrix, C++, C , and C+-. The tables 

with a and fi are available on the internefl 



3.2. Variation of tlie inverse covariance witli snd 

From the variation of the power spectrum with Qm and erg 
(Sect. l3.lT l it is clear that covariances vary with respect to como- 
logical parameters. For simplicity and in order to increase the 
readability of the following sections we refer to this variation 
as CDC-effect (CDC = Cosmology Dependent Covariances). 
In order to examine the CDC-effect more closely, recall that 
the structure of the covariance is given by 



C = 



C++ 



V ^+- 



(23) 



http://www.astro.uni-bonn.de/~teifler/fit-parameters.pdf 



and the individual parts are calculated from (|5]l - ( fTOl i. From 
these equations we see that the covariances are filtered versions 
of the power spectrum, either filtered by a product of Jo's (in 

case of C++), J4's (C ), or a combination of both (C+_). The 

strength of the CDC-effect depends on these filter functions, as 
they determine which parts of the power spectrum are sampled. 
A change in Oni and erg affects all scales of the power spec- 
trum almost similarly (see Fig.[T]i therefore, the CDC-effect for 
the individual parts of C is also similar However, this might 
change when considering different cosmological parameters, 
e.g. the shape parameter F. A change in F rotates the power 
spectrum. The covariances are integrals over Pe, and depend- 
ing on the filter function, the change in Pe can average out. A 
second argument why the individual covariance parts have dif- 
ferent sensitivity to the CDC-effect is that C+_ is not affected 
from shot noise, hence a change in cosmology has a stronger 
impact on C+_ compared to C++ and C— . 
In order to quantify the CDC-effect we examine the trace of the 
inverse covariance matrix C"'. The trace of the covariance it- 
self is an improper measure for this effect, as it depends on the 
binning, which can be seen from The trace of C becomes 
arbitrarily large when decreasing the bin width. In contrast, we 
checked numerically that for the trace of C ' binning effects 
are negligible, once one has exceeded a minimum bin number. 
More precisely, once the bin width of the 2PCF data vector 
is small enough that discretization effects are unimportant, the 
trace of C ' hardly changes for different binning. 
Figure 12] shows the trace of the inverse covariance matrix de- 
pending on for various constant erg (top) and vice versa 
(bottom). Here, we normalize the survey size to A = 1 deg^; 
the other survey parameters are cr^ =0.4 and fi - 10/arcmin'^. 
We postpone a detailed analysis of how survey parameters in- 
fluence the CDC-effect to Sect. |4] Qualitatively the result does 
not change for different survey parameters; the trace of C ' de- 
creases with increasing flm or erg. 

In addition, we perform a singular value decomposition (SVD) 
for each inverse covariance matrix. For the case of a symmet- 
ric and positive definite matrix, such as the inverse covariance 
matrix, an SVD yields the eigenvalues in decreasing order. For 
arbitrary i, we find that the i-th eigenvalue decreases when in- 
creasing Qm or (Tg. The strength of the CDC-effect, i.e. the gra- 
dient of the traces, depends on the considered point in parame- 
ter space. 
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Fig. 2. The trace of the inverse covariance matrix C ' depend- 
ing on Oni (top), the individual lines in each figure corre- 
spond to (from top to bottom) erg - [0.6,0.7,0.8,0.9,1.0]. 
The lower panel shows the dependence on erg, the indi- 
vidual lines corresponding to (from top to bottom) = 
[0.2,0.25,0.3,0.35,0.4]. 

4. Impact of the CDC-effect on parameter 
estimation 

4.1. Basics of the likelihood analysis 

Throughout the whole likelihood analysis we assume the 
ACDM model. We define the posterior likelihood p{7T\^) for 
the case of a 2PCF data vector as 

where pin) denotes the prior probability density, ^(^l^r) is the 
likelihood and p{^) denotes the evidence. The prior usually 
contains knowledge on the parameter vector tt coming from 
former experiments. Here, we assume flat priors with cutoffs, 
which means p{7r) is constant for all parameters inside a fixed 
interval (i.e. Q„ e [0.2; 0.4], o-g E [0.6; 1.0]) and p{n) = else. 
We assume that ^ is normally distributed in parameter space, 
hence our likelihood pi^ln) can be written as 



exp [-i ((^.-1)' C-i (^, 
(2;ry/2 |C|i 



(25) 



where ^ denotes the mean data vector, ^„ the model data vector, 
d is the dimension of the data vectors, hence |C| is the determi- 
nant of a dxd covariance matrix. Note that the 2PCF data vector 
consists of two parts (^+ and ^_), each with d/2 bins. The evi- 
dence is a normalization obtained by integrating the likelihood 
over the considered parameter space 



P(€) 



exp[-i ((^.-1)' (^,- 
{Ijryn- \C\i 



(26) 



In our case we calculate ^ from Pe via (O assuming our fidu- 
cial cosmology; is calculated similarly but its cosmological 
model varies according to the considered point in parameter 
space. The result of a likelihood analysis is usually summa- 
rized in contour plots. In a Bayesian approach, these likelihood 
contours represent so-called credible regions, i.e. a region in 
parameter space, where the true parameter is located with a 
probability of 68%, 95%, 99,9%, respectively. In addition, we 
quantify the size of these credible regions through the deter- 
min ant of the second-order mome nt of the posterior likelihood 
(see lKilbinger & Schneidej|2004 



(27) 



with TTi and 712 as the varied parameters, 7r[ as the parameter of 
the fiducial model. The determinant is given by 



\Qij\ = ^QuQii-Qlo- 



(28) 



Smaller credible regions in parameter space correspond to a 
smaller value of q. In this paper all q are given in units of 10 

4.2. Results of the likelihood analysis 

In Sect.|3]we calculate 2500 covariances covering a parameter 
range of Q.^ e [0.2; 0.4] and erg e [0.6; 1.0]. Here, we want 
to examine how the CDC-effect influences the likelihood con- 
tours, hence for each of the 2500 covariance matrices we per- 
form a likelihood analysis. In these analyses, we consider the 
same parameter space, similar priors, similar ^ and ^j^, only 
the covariance in ( l25T l is changed. The left panel of Fig. [3] 
shows the 95%-credible intervals when choosing Qm = 0.2 and 
cTg = 0.6 (solid). On, = 0.4, and erg - 1.0 (dotted) as a model 
for the covariance matrix. We compare these to the (dashed) 
case when the covariance is calculated from the fiducial model 
(fim = 0.25, (Tg = 0.9). These examples illustrate that assum- 
ing different cosmologies in the covariance can significantly 
broaden or narrow the likelihood contours. As expected from 
the foregoing analysis of the inverse covariance traces (Sect. |3]l 
the contours broaden for increasing Oni and erg. 
Without any information which cosmology to choose in our 
covariance matrix, it is reasonable to include prior informa- 
tion coming from other cosmological probes into our covari- 
ance cosmology. The middle panel of Fig. |3] shows the 95% 
credible intervals when calculating the covariance from the 
minimum, mean, and maximum values of the 68% confidence 



regio n of the recent WMAP 5-years analysis ( IKomatsu et al 



20081). Compared to the left panel the deviation of the contours 
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0.20 0.25 0.30 0.35 0.20 0.25 0.30 0.35 0.20 0.25 0.30 0.35 

flm i^rn ^^m 



Fig. 3. The 95%-credible intervals obtained from likelihood analyses with different cosmological models assumed in their co- 
variance matrix. The left panel corresponds to the following covariance parameters: - 0.2, erg = 0.6 (solid), 0„, - 0.25, 
(Ts - 0.9 (dashed), and 0^ = 0.4, erg = 1.0 (dotted). The middle panel shows the deviation which occurs when restricting the 
range of possible covariance models to the 68% confidence interval of the WMAP5 analysis, i.e. Qn, = 0.237, erg = 0.74 (solid), 
£lm - 0.259, erg - 0.796 (dashed), and Oni = 0.274, erg - 0.85 (dotted). The right panel shows the same analysis but for the 
95% confidence interval of the WMAP5 analysis, i.e. D.^ = 0.226, erg = 0.70 (solid), Qn, = 0.237, erg = 0.74 (dashed), and 
On, = 0.288, erg = 0.885 (dotted). 



Table 1. Values of q for different covariance models 



parameters used for the covariance 


q [10-"] 




= 0.25, 0-8 = 0.9 


1.03 




= 0.2, 0-8 = 0.6 


0.28 




= 0.4,0-8 = 1.0 


3.30 




= 0.259, o-g = 0.796 (WMAP5 68 % CL mean) 


0.75 




= 0.237, erg = 0.740 (WMAP5 68 % CL min) 


0.56 




= 0.274, erg = 0.850 (WMAP5 68 % CL max) 


1.02 


n„ 


= 0.226, o-g = 0.700 (WMAP5 95 % CL min) 


0.45 




= 0.288, o-g = 0.885 (WMAP5 95 % CL max) 


1.24 



reduces significantly, nevertheless it is still noticeable and can- 
not be neglected in a precision cosmology analysis. Similarly, 
the right panel shows the impact of the CDC-effect when cal- 
culating the covariance from parameters within the 95% confi- 
dence region of the recent WMAP5 analysis. For a better com- 
parison we calculate the values of q (Sect. 14.1b for all contour 
plots and summarize them in Table [1] Restricting the possible 
cosmologies for the covariance to the 68% contour region of 
the WMAP5 analysis, the values of q deviate by a factor of 
^ 1.84. This factor increases to x 2.76 when considering the 
minimum and maximum values of the 95% confidence region 
of the WMAP5 constraints. In Fig. |4] we show the values of 
q for all 2500 likelihood analyses depending on Qn, (top) and 
cTg (bottom). Similar to the parameter dependence of the in- 
verse covariances in Sect. |3] the strength of the CDC-effect, 
i.e. the gradient of the curves in Fig. HI depends on the con- 
sidered point in parameter space. At the fiducial model we 
calculate (dq/dD.ta)fid - 7.5, whereas in case of erg we find 
(dq/do-s)M = 3.5. 



4.3. Impact of survey parameters on the CDC-effect 

In the last section we have shown, that the CDC-effect non- 
negligibly affects the likelihood contours. However, we only 
quantify this for one specific set of survey parameters. In this 
section we examine how the impact of the CDC-effect on like- 
lihood contours depends on survey parameters, namely survey 
size A, ellipticity dispersion er^, and number density of source 
galaxies n, where in case of the latter two only the combination 
er^/n is of interest. We perform likelihood analyses for 9 dif- 
ferent combinations of cr^/n and 8 different survey sizes. The 
strength of the CDC-effect is quantified by the ratio of maxi- 
mum to minimum value of q, which occur within the consid- 
ered range of Qn, and erg, we define Aq - <?(7rn,ax)/'?(?rn,in)- The 
minimum q is obtained when choosing the minimum parame- 
ter set in the calculation of the covariance, i.e. ;rn,in = (^^m - 
0.2, erg - 0.6). Correspondingly, choosing the maximum pa- 
rameter set TTmax - i^m = 0.4, erg = 1 .0) rcsults in the maximal 
q. The values of q represent the size of credible intervals, hence 

can be interpreted as their ratio. 
Unfortunately, it is not possible to derive an analytical expres- 
sion for the relation between A^ and the survey parameters. 
From ^ - ([Tol l we see that the individual covariance terms 
scale differently with cr^/n. This already prohibits an analyti- 
cally derived relation between A^ and er^/fz. Considering the 
survey size A, dH) - (fTOl i imply that the total covariance scales 
with I /A. When comparing two (inverse) covariances with dif- 
ferent cosmologies by taking their ratio, the survey size can- 
cels, suggesting the strength of CDC-effect to be independent 
of A. However, when considering the likelihood, the inverse 
covariance enters in the exponent, furthermore the values of q 
are an integral over the posterior likelihood. This non-linearity 
in the inverse covariance causes that the strength of the CDC- 
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Fig. 4. The values of q depending on (top), the individ- 
ual lines in each figure correspond to (from top to bottom) 
erg - [0.6, 0.7, 0.8, 0.9, 1.0]. The lower panel shows the depen- 
dence on erg, the individual lines corresponding to (from top to 
bottom) = [0.2,0.25,0.3,0.35,0.4]. 



effect varies with the survey size. An analytic expression of 
this dependence cannot be derived, for similar reasons as for 
the case of cr^/«. We therefore calculate Aq depending on the 
survey parameters numerically. 

The upper panel of Fig. |5] shows Aq - q(nm?Lid I qi'^mm) as a 
function of cr^/«. The ratio changes from 4 to 18 over the 
considered interval of cr^/«. When increasing the survey size A 
(Fig lD, lower panel), we find that the impact of the CDC-efFect 
increases from Aq - 2.5 (for a 25 deg^ survey) up to l^q - 12.3 
(for a 2500 deg^ survey). Note that the size of the likelihood 
contours, hence the values of q themselves, decrease with de- 
creasing crlln and increasing A. In contrast, Aq increases with 
decreasing cr^/« and increasing A. Hence relatively, the CDC- 
effect becomes more important when increasing the survey size 
or when decreasing the ratio cr^/«. 



Fig. 5. The ratio of maximum to minimum value of q depending 
on the ratio cr^/n {upper panel) and depending on the survey 
size A {lower panel). 

5. Likelihood analysis with a model dependent 
covariance 

5.1. Adaptive covariance matrix 

For a given cosmological model we can calculate the covari- 
ance directly from (|5]l - (fTOl i. This enables us to perform a like- 
lihood analysis, where the covariance is calculated individually 
for every point in parameter space. We denote this parameter 
dependent covariance as and rewrite the likelihood (|25] ) as 



p{m 



exp[-^ ((^.-1)' C,' (^. 



(29) 



Compared to the case of a constant covariance, there are two 
main differences. First, the covariance in the exponential term 
of (l29T l changes according to the considered point in parame- 
ter space. Second, |C;r|" is now parameter dependent, therefore 
the determinant no longer cancels with a similar term in the evi- 
dence. As a consequence, the posterior likelihood does not only 
depend on the exponential terms, which basically compare 
and ^, but it is also aff'ected by the determinants of the covari- 
ance matrices, more precisely by their behavior in parameter 
space. In the following we quantify the impact of the determi- 
nant term. 
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The upper left panel in Fig.|6]shows the likelihood contours for 
a 84 deg^ survey, where the posterior probability is calculated 
via the new likelihood (|29T l. For comparison, the right panel 
shows the likelihood contours when neglecting the parameter 
dependence in the determinant terms, hence considering a pa- 
rameter dependent covariance only in the exponential terms. 
One clearly sees that the determinant terms shift the likelihood 
contours and cause a difference between the best-fit value and 
the fiducial model. In order to explain this shift we overlay the 
right panels of Fig. |6] with the contours of constant IC^rT'^^ 
(for numerical reasons we plot In IC^rT'^^). We see, that the co- 
variance determinant is a monotonic function of Qm and erg; 
it decreases with increasing Q,n or erg. Hence, IC^rT^ induces a 
parameter-dependent weighting, which increases the likelihood 
at small Q.^ and erg and vice versa suppresses large Qm and erg. 
In general, the exponential term dominates the likelihood, 
IC^tT'^" only has significant impact on parameter regions where 
the exponential hardly changes. For the highly degenerate case 
of On, and erg, this applies to curves where erg ^ const x Q„,'''^. 
Compared to these curves, the contours of constant IC^rl '^^ are 
slightly rotated, which allows for different values of the latter in 
regions where the exponential term is constant. As a result, the 
likelihood contours in the left panel are shifted and stretched 
towards regions of larger |C;n-r'^^ compared to the right panel. 
Note that for a different parameter combination this bias might 
not cause such a large shift of the best-fit value. 
The second row of Fig.|6]shows the same analysis but for a 900 
deg^ survey. Comparing the left and right panel, we see that the 
likelihood contours are, similar to the 84 deg^ survey, shifted 
and stretched towards regions of larger |C^r'^^. However, the 
effect is hardly noticeable and the bias of the best-fit value has 
basically vanished. This can be explained when looking at the 
expression of the posterior likelihood 



where L - - In ^(^rl^), n - (tti, ...,n„) describes the underly- 
ing (cosmological) parameters, and ttml denotes the maximum 
likelihood parameter vector. 

In addition, if we Taylor-expand L{7r) in parameter space at the 
fiducial parameters we derive 

L(n) = LiTiM) + + i(7r - mdf (ti - tim) + O(A^) , (32) 



with 



(T 



>U 



dni dn , 



(33) 



The first-order term vanishes since {dL/d7:)\,rM zero, hence 
( |32] | is dominated by second-order terms. In this analysis we 
only consider and erg; timl corresponds to our fiducial 
model, i.e. ttm = (f^m = 0.25, erg - 0.9). Comparing ( |33] ) and 
dSTT i one sees that the Fisher matrix and the inverse parameter 



(34) 



covariance matrix T are equal. We rewrite ( l32l ) as 



L{71m) + ^(^ - ^fid)' F (;r - tim) + 0{Al) . 



This equation enables us, for a given Fisher matrix, to calculate 
lower bounds for L{n), hence we can derive lower bounds on 
the likelihood contours. For the case that p{^\7t) is Gaussian, 
which at least close to the maximum likelihood parameter vec- 
tor is a good approximation {p{^\7:) oc exp[-L]), one can di- 
rectly express the Fisher matrix in ter ms of the m ean data vec- 
tor and the data covariance matrix (e.g. Tegmarket al. 19971) 



',V=^tr[c-'C,C-'C, + C-'M,.,] 



(35) 



Pirn 



exp[-l ((^.-1)' C,' (#.-!))] 
/ d;r'|C^; CJ3 exp [-\ [(^,, - 1)' C^/ {^,, 



.(30) 



Compared to the case of a constant covariance the above 
expression has an additional factor in the denominator, i.e. 
|C;rCj^,'|'^~. Note, that this factor is independent of the sur- 
vey size A, whereas the importance of the exponential term 
increases with increasing A. As a result, the cosmology de- 
pendence of the covariance determinant becomes negligible for 
sufficiently large surveys. 

5.2. Fisher matrix analysis 

We expect tighter constraints on cosmological parameters if 
the cosmology dependence of both, mean data vector and 
covariance matrix, is incorporated into the likel ihood analysis. 



instead of only using the mean data vector (iTegmark et al 
19971). The Fisher information matrix can be used to illus 



trate this effect; its definition reads (iKendall & Stuart Il979t 
Tegmarketalll 1997b 



Fo- - ' 



d^L{n)\ Id^Lin) 



(31) 



where C,, = dC/dni denotes the derivative of the covariance 
matrix with respect to the i-th component of the parameter vec- 
tor and Mij s ^j^^j + The first term of i35[ vanishes 
in case the covariance matrix is constant in parameter space, 
the second term vanishes in case of a constant mean. For cos- 
mic shear we have seen that neither the mean data vector, nor 
the covariance matrix are independent of cosmological parame- 
ters, hence when calculating the Fisher matrix both terms must 
be taken into account. Recall that C oc 1 /A, which also holds 
for the derivatives C_„ hence the first term is independent of 
the survey size. The second term increases proportional to the 
survey volume, therefore the information gain on cosmological 
parameters, through incorporating the cosmology dependence 
of covariances, becomes less important for large surveys. 
Figure|7]shows the results of the Fisher matrix analysis for two 
different survey sizes (84 deg^ on the left and 900 deg^ on 
the right). As expected, the left panel (smaller survey) shows 
a small improvement, which vanishes completely in case of the 
larger survey (right panel). Nevertheless, one should keep in 
mind that we only consider Gaussian covariances. The cosmol- 
ogy dependence of the covariance becomes larger for the case 
of non-Gaussian covariances for the following reason. Non- 
Gaussianity increases the cosmic variance term, in particular it 
becomes important on small scales, which are still dominated 
by shot noise in the pure Gaussian case. As the CDC-effect 
mainly results from the cosmic variance term, its strength also 
increases in the non-Gaussian case. A stronger dependence of 
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Fig. 6. The left plots shows the likelihood 
contours when using a model- dependent 
covariance, more explicitly, when calculat- 
ing the posterior from ([29]l. The cross il- 
lustrates the best-fit value, whereas the cir- 
cle indicates our fiducial model. The pan- 
els on the r.h.s. show the likelihood contours 
obtained when neglecting the determinant- 
terms l |29t . The dotted contours visualize re- 
gions of constant In |C;r'|"''^. The likelihood 
contours in the upper row correspond to a 
survey size of 84 deg^, whereas the lower 
panels con'espond to A = 900 deg^. 




0.30 



0.30 



Fig. 7. Likelihood contours from a Fisher 
matrix analysis for a 84 deg" survey (left), 
and for a 900 deg" (right). The dashed lines 
correspond to the same analysis but neglect- 
ing the covariance term. The dot indicates 
the fiducial model at which the Fisher ma- 
trix was calculated. Note, that in the right 
panel dashed and solid contours are identi- 
cal 



the covariance on parameters enlarges the first term in ( |35] |. 
which implies that for the case of truly non-Gaussian covari- 
ances the improvement on parameter constraints is more sig- 
nificant than shown in Fig. Q 

5.3. Iterative likelihood analysis 

In Sect. l5.ll we have introduced the adaptive covariance, which 
is a proper way to incorporate cosmology dependent covari- 
ances into a likelihood analysis. Its disadvantage is the large 
computational effort, which is high already for Gaussian co- 
variances. In order to account for non-Gaussianity, one must 
employ ray-tracing covariances derived from many numerical 
simulations with different underlying cosmologies. In a multi- 
dimensional parameter space, this is clearly unfeasible with to- 



day's computer power. 

In this section we quantify the impact on likelihood contours 
when using non-Gaussian instead of Gaussian covariances. We 
use a ra y-tracing covarianc e taken from the Millennium sim- 



ulation jHilbertet al.l 120081) . neglect the CDC-effect and ap- 



proximate the covariance to be constant in parameter space. 
The error in the posterior likelihood caused by this approxi- 
mation increases with increasing distance to the cosmology of 
the ray-tracing simulation. As we are mainly interested in re- 
gions around the maximum likelihood parameter set, ttml, this 
suggests the following strategy for a likelihood analysis. First, 
perform an iterative likelihood analysis using Gaussian covari- 
ances in order to derive ttml- Then, start a numerical simulation 
with this cosmology, derive a ray-tracing covariance, and per- 
form the final likelihood analysis. This ansatz minimizes the 
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Table 2. The ML-parameter sets which occur when choosing different starting cosmologies in the iterative likelihood analysis. 



step 


runl 

fi,n erg 


run2 


run3 

fin, O-g 


run4 


run5 

as 


^start 

^ML 2 
^ML 3 

mL 4 


0.20 0.60 
0.254 0.892 
0.260 0.882 
0.260 0.882 
converged 


0.237 0.740 
0.254 0.892 
0.260 0.882 
0.260 0.882 
converged 


0.250 0.90 
0.245 0.914 
0.245 0.914 
converged 
converged 


0.274 0.850 
0.259 0.884 
0.260 0.882 
0.260 0.882 
converged 


0.40 1.0 
0.277 0.858 
0.259 0.884 
0.260 0.882 
0.260 0.882 




q=0.44 




0.20 0.22 0.24 0.26 0.28 0.30 0.32 0.20 0.22 0.24 0.26 0.28 0.30 0.32 



Fig. 8. The likelihood contours when us- 
ing a ray-tracing covariance derived from 
the Millennium Simulation via field-to-field 
variation (left panel), compared to the case 
of a Gaussian covariance (right panel). 
Although the original size of each field is 
only 16 deg^, we extrapolated the covari- 
ance to a 900 deg^ survey. The values of q 
are given in units of 10"*. 



errors due to the CDC-effect in the region of interest and addi- 
tionally incorporates non-Gaussianity. 

In order to derive ttml iteratively, we start from an arbitrary 
cosmology, calculate a Gaussian covariance matrix therefrom 
using (|5]l - ( flOl ). and perform a likelihood analysis. Throughout 
this first iteration step the covariance matrix is kept constant. 
In the second step we choose the ML-parameter set of the first 
analysis as the underlying cosmology for the new covariance 
matrix, and again perform a likelihood analysis. We continue 
this iteration process until the ML-parameter set converges. 
The main difficulty of this ansatz is that the choice of the start- 
ing cosmology might influence the final ML-parameter esti- 
mate and therefore also the final covariance. In order to check 
for this, we take the noise of a ray-tracing data vector, add it to 
our fiducial data vector and thereby simulate measurement un- 
certainties in the latter. When performing the analysis without 
noise the iteration converges after one step, as the model data 
vector (at the fiducial model) exactly fits the fiducial data vec- 
tor, ^jtm - Table|2]shows the results for 5 iterative likelihood 
analyses, each starting from a different cosmology in the co- 
variance. We see that all 5 runs converge quickly, 4 of them to 
the same cosmology. Only the run which started from the fidu- 
cial model deviates from the others. Although the suggested 
ttml are close to ttm, we note that none of the runs converges to 
the fiducial model. This implies that the starting cosmology can 
bias the final outcome of the iterative likelihood analysis and 
can shift the ML-estimate. In general, such a bias occurs if the 
function ^^-^ does not fall off steeply enough around the ML- 
parameter set, which especially applies to higher-dimensional 
likelihood analyses. 

Our iterative pre-analysis has converged to 0^ - 0.26, erg = 



0.882, however we "only" have a ray-tracing simulation with 
0,n = 0.25, erg - 0.9 available. Figure [8] shows the result of 
our likelihood analysis, when using the ray-tracing covariance 
of the Millennium simulation (left panel). Compared to a like- 
lihood analysis using a Gaussian covariance (right panel), the 
contours broaden significantly; q increases from 0.44x10""* in 
the Gaussian to 0.78x10""* in the non-Gaussian case. Note that 
the value of q in the Gaussian case does not correspond to that 
in Table [T] because we use different survey parameters (here, 
cTj = 0.3, n - 15/arcmin^) and a different data vector (here, 
30 logarithmic bins from 0.2-130 arcmin) in order to exactly 
match the corresponding parameters of the ray-tracing covari- 
ance. 

The impact of non-Gaussianity depends on the scales probed 
by the data vector. In our case 20 bins are below 10 arcmin, 
therefore the impact is relatively high. Choosing linear bins or 
probing higher § reduces the difference to the Gaussian case. 
For the data vector considered here, this difference is of the 
same order as the impact of the CDC-effect we described in 
Sect. 14.21 However, the strength of the latter will most likely 
increase for non-Gaussian covariances, as we explained at the 
end of the last section. 

6. Conclusions 

An accurate likelihood analysis plays an essential role in future 
precision cosmology. We can only exploit the full potential of 
upcoming high quality data, if we use appropriate statistical 
methods. In this context the derivation of covariances is an im- 
portant issue in order not to bias the parameter constraints. 
In cosmic shear, there are several methods to derive covari- 
ances. First, one can calculate C analytically assuming a 
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Gaussian shear field. This assumption breaks down on small 
angular scales (< 10 arcmin), where non-linearities of the mat- 
ter density field start to become important. Second, covari- 
ances can be estimated from ray-tracing simulations. Although 
computationally more expensive, this method automatically ac- 
counts for the non-Gaussianity of the shear field. In both meth- 
ods the covariance is calculated assuming a specific cosmology. 
In the first case, this cosmology enters in the power spectrum 
from which C is calculated, in the second case we estimate C 
from numerical simulations, which are also based on a given 
cosmology. Past cosmic shear data analyses approximate the 
covariance to be constant in parameter space, therefore assume 
that its underlying cosmology does not influence the result of a 
likelihood analysis significantly. 

In this paper we have shown that the covariance matrix depends 
non-negligibly on its underlying cosmology and that this CDC- 
effect significantly influences the likelihood contours of param- 
eter constraints. To prove this, we calculate 2500 Gaussian co- 
variance matrices for various parameters of 6 [0.2; 0.4] 
and cTg e [0.6; 1.0]; all other cosmological parameter are held 
fixed. Even a change of Q.^ and erg within the WMAP5 68% 
confidence levels has a non-negligible impact on the likelihood 
contours. Here, the value of q deviates by a factor of 1.84 and 
this deviation increases to 2.76 if one considers the WMAP5 
95% confidence levels. Furthermore, we show that the impact 
of the CDC-effect depends on survey parameters. Although the 
likelihood contours become smaller, relatively the CDC-effect 
becomes more important when increasing the survey size or 
when decreasing the ratio o-^/fi. Therefore, a proper treatment 
becomes more important in the future, for large and deep sur- 
veys. 

To take cosmology dependent covariances into account we 
present two methods. First, we perform a likelihood analysis 
with an adaptive covariance matrix. Here, C is calculated indi- 
vidually for every point in parameter space, assuming the cor- 
responding parameters as the underlying cosmology. For small 
surveys this method introduces a bias to the best-fit parameter 
set, which vanishes when going to larger survey sizes. A disad- 
vantage of this approach is its computational costs. Using the 
analytic expression for Gaussian covariances is already time- 
consuming, using ray-tracing covariances to include the non- 
Gaussianity is not feasible with today's computer power. For 
the Gaussian case we present a fast and convenient scaling 
relation to derive covariances on a parameter grid. As a side- 
effect this approach enhances the constraints on cosmology, for 
the reason that we now incorporate two cosmology dependent 
quantities into the likelihood analysis instead of only the mean 
data vector. 

In a strict sense the second method does not account properly 
for the CDC-effect, however it minimizes the error around the 
maximum likelihood parameter set (^Tml)- The method con- 
sists of two steps, first derive ^Tml through an iterative process, 
then derive a ray-tracing covariance with ttml as underlying 
cosmology and incorporate this in the final likelihood analy- 
sis. Here, the approximation of a constant covariance is made, 
however the error in the posterior probability is minimized 
in the region of interest; in addition, this ansatz incorporates 
non-Gaussianity which is non-negligible for future surveys. A 



drawback is the fact the the starting point of the iteration might 
bias ttml- This must be checked carefully before employing this 
method, otherwise the approximation of a constant covariance 
fails around ^Tml- 
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